Quantitative performance analysis of the SPEC OMPM2001 benchmarks
نویسندگان
چکیده
The state of modern computer systems has evolved to allow easy access to multiprocessor systems by supporting multiple processors on a single physical package. As the multiprocessor hardware evolves, new ways of programming it are also developed. Some inventions may merely be adopting and standardizing the older paradigms. One such evolving standard for programming shared-memory parallel computers is the OpenMP API. The Standard Performance Evaluation Corporation (SPEC) has created a suite of parallel programs called SPEC OMP to compare and evaluate modern shared-memory multiprocessor systems using the OpenMP standard. We have studied these benchmarks in detail to understand their performance on a modern architecture. In this paper, we present detailed measurements of the benchmarks. We organize, summarize, and display our measurements using a Quantitative Model. We present a detailed discussion and derivation of the model. Also, we discuss the important loops in the SPEC OMPM2001 benchmarks and the reasons for less than ideal speedup on our platform.
منابع مشابه
Large System Performance of SPEC OMP2001 Benchmarks
Performance characteristics of application programs on large-scale systems are often significantly different from those on smaller systems. SPEC OMP2001 is a benchmark suite intended for measuring performance of modern shared memory parallel systems. The first component of the suite, SPEC OMPM2001, is developed for medium-scale (4to 16-way) systems. We present our experiences on benchmark devel...
متن کاملCCRG OpenMP Compiler: Experiments and Improvements
In this paper, we present the design and experiments of a practical OpenMP compiler for SMP, called CCRG OpenMP Compiler, with the focus on its performance comparison with commercial Intel Fortran Compiler 8.0 using SPEC OMPM2001 benchmarks. The preliminary experiments showed that CCRG OpenMP is a quite robust and efficient compiler for most of the benchmarks except mgrid and wupwise. Then, fur...
متن کاملMemory Performance and SPEC OpenMP Scalability on Quad-Socket x86_64 Systems
Because of the continuous trend towards higher core counts, parallelization is mandatory for many application domains beyond the traditional HPC sector. Current commodity servers comprise up to 48 processor cores in configurations with only four sockets. Those shared memory systems have distinct NUMA characteristics. The exact location of data within the memory system significantly affects both...
متن کاملPerformance Analysis and Improvement of OpenMP on Software Distributed Shared Memory Systems
In this paper, the performance of the portable OpenMP compiler on SDSM JIAJIA is analyzed using SPEC OMPM2001 benchmark. The overheads of parallel execution have been investigated from the aspects of thread management and task schedule, memory access and synchronization. To improve the performance, the page placement and data privatization techniques have been implemented for the optimization o...
متن کاملSupporting Realistic OpenMP Applications on a Commodity Cluster of Workstations
This paper describes compiler techniques that can translate standard OpenMP applications into code for distributed computer systems and optimization techniques to enhance the performance of realistic OpenMP application. OpenMP has emerged as an important model and language extension for shared-memory parallel programming. However, despite OpenMP's success on these platforms, it is not currently...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Scientific Programming
دوره 11 شماره
صفحات -
تاریخ انتشار 2003